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(57) Abstract: A system and method of performing genomic 
research via online analysis of genomic data objects. A 
novel technical information model and implementation 
facility combines genomic visualization and analysis tools 
with the ability to link genomic data objects across (and 
within) plural databases. This provides for interactivity that 
is not possible using conventional Intemel access processes 
and also provides for data integration that further enhances 
the effectiveness of genomic research. A business process 
controls how the databases are accessed, particularly the 
commercial fcc-for-acccss databases, so that the researcher 
using a system according to the invention need not be 
concerned with keeping track of access limitations. 



WO 01/55911 PCT/US01/02527 



1 Title: INTEGRATED ACCESS TO BIOMEDICAL RESOURCES 

2 

3 BACKGROUND OF THE INVENTION 

4 1. Field of the Invention 

5 The present invention relates generally to the field of data processing. More 

6 particularly, the present invention relates to the use of the Internet to access a variety of 

7 biomedical data resources in an integrated fashion, in order to provide for increased 

8 efficiency in genomic research. 

9 2. Background Information 

10 Almost every cell of every living organism contains a complete set of instructions 

1 1 for creating that organism and regulating its cellular structures and activities over its 

12 lifetime. That set of instructions is called a genome. 

13 A genome is organized into distinct, microscopic units called chromosomes. 

14 Chromosomes are coiled threads of deoxyribonucleic acid (DNA). Each thread of DNA is 

15 composed of two long chains of nucleotides bound together in pairs to form a double 

16 helix. The human genome is made up of three and a half billion of these nucleotide pairs. 

17 A given DNA strand contains the cells' instructions for producing proteins. These 

18 instructions are in the form of specific sequences of nucleotide bases, called genes, within 

19 the DNA strand. Scientists estimate that 80,000 to 100,000 of these basic units of heredity 

20 exist within the human genome. Proteins perform a wide variety of physiological tasks. 

21 They facilitate processes such as digestion, breathing, immune responses, the production 

22 of heat and energy, and the movement of fluids in and out of cells. 

23 Most members of a species have the same collection of genes. However, each 

24 individual's unique characteristics stem from slight variations in the sequence of the 
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nucleotides that comprise the genes of that individual. These slight genetic variations that 
define unique characteristics of individuals are called polymorphisms. On average, the 
DNA of any two individuals in a species will differ by about 0.1%. 

Another class of variations - called mutations — also occurs. Both polymorphic 
and mutagenic variations may be harmful to an individual by inhibiting the production, or 
altering the normal function, of a protein. Most diseases result from these types of genetic 
variations. 

Genomics is the study of the nucleic sequences within a genome. The goal of 
genomic inquiry is to identify the sequence of nucleotides, understand the function of 
every gene they comprise, and clarify the genetic variations that define individuality and 
create disease. 

Genomics has a broad scope of applications. These range from the most basic of 
research endeavors to the promise of diagnostic usefulness. 

An important factor limiting the development of new drugs is the limited number 
of known target molecules for which new drugs can be developed. Disease target 
molecules are those that can be affected by a drug and cause a subsequent, desired 
biological reaction in the body. Historically, the process of discovering new target 
molecules has been extremely slow and very expensive due to reliance on trial-and-crror 
approaches to discovery. Genomic research will reduce the reliance on trial-and-error by 
enabling drug designers to go directly to target molecules of interest. Thus, applying 
genomic research to drug development should produce new and better drugs more quickly, 
and at a reduced cost. 

Another way that genomic research can help the pharmaceutical industry is in the 
emerging field of pharmacogenomics. Pharmacogenomics focuses on identifying genetic 
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variation among patients that may affect the efficacy of drug treatment - how well an 
individual's body absorbs and metabolizes a specific drug - in order to develop more 
personalized drug therapies. Nearly all drug companies are developing 
pharmacogenomics units as a reaction to increasing evidence that a given drug does not 
have the same effect on all people. 

In particular, pharmacogenomics is believed to offer at least three different useful 
applications: 

• Increasing the success rate of clinical trials by improving the process of patient 
population selection; 

• Identifying new uses for existing drugs; and 

• Rescuing drugs that have failed previous drug trials by identifying more 
appropriate populations for using the drug. Candidates for drugs to be rescued 
include those that produce adverse reactions in particular sub-populations. 
Molecular toxicology is another area of technology that can benefit from genomics 

research. Approximately 2.2 million Americans are admitted to hospitals every year as a 
result of adverse side effects from drugs. Over 100,000 Americans die annually from 
these adverse (and often unpredictable) effects. For instance, some cause liver damage, 
while others are harmful to the kidneys. Organ-specific gene expression profiles for drugs 
already available will enable researchers to study the toxicity of new drug compounds with 
more certainty. 

In addition, gene expression data, combined with polymorphism information 
related to metabolic pathways, will provide important indications of the way an individual 
patient will react to drugs of various dosage levels, thereby significantly reducing the 
unwanted side effects of therapy. 
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Risk assessment is a major area of diagnostics that will benefit from genomics. 
Historically, prediction of whether someone is at special risk for a particular disease has 
focused on measuring general indicators in the body, such as blood pressure and 
cholesterol levels. These measurements reflect general physiology but do not explain the 
specific genetic basis of disease in an individual patient. Consequently, these diagnostic 
tests do not discern the underlying cause of disease and can result in compromised medical 
care for patients and increased risk of litigation. 

New genomic-based diagnostics will focus on determining an individual's risk of 
developing a particular disease by looking at specific genes and any disease-related 
changes in that patient. These new diagnostics will likely lead to far better preventive care 
by offering more accurate assessments of a patient's potential risk for developing a 
particular disease. 

Personalized medicine is another major area of diagnostics that will benefit from 
genomics. Genomic information will be available to develop molecular diagnostic tests to 
identify the genetic make-up of individuals. These diagnostic tests will revolutionize 
medicine by enabling physicians to establish therapies designed for each patient, i.e., 
personalized medicine. 

For example, many types of cancer that are distinct at the cellular level 
nevertheless have similar symptoms. Because symptoms may be similar in one genetic 
type of cancer and another, it is important to know everything possible about cancer genes 
and their interactions in prescribing an effective treatment. 

As another example, physicians will be able to use a molecular/genomic test to 
help select the most effective drug with the minimum number of side effects. As a result, 
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this approach should benefit the patient with more customized care, reduced length of 
illness, and, ultimately, a better and longer life. 

Besides healthcare, the field of agriculture is also likely to benefit from genomic 
research. The ability to diagnose plant and animal diseases and develop treatments 
targeted against those diseases should produce better agricultural products and improve 
yields. For example, the comparison of genetic information from disease-or pest-resistant 
plant strains with non-resistant strains and the use of selective breeding programs for 
favorable traits will significantly increase the number and success of new strains available 
to various agricultural areas around the world. This has major implications for not only 
increasing the quantity of food but also its nutritional quality. 

Other fields that will likely derive important benefits from genomic information 
include forensics, veterinary medicine, textile production, waste control, and 
environmental remediation. 

A significant impediment to achieving any of the foregoing expected beneficial 
results of genomic research is the sheer size of the amount of genomic information to be 
sifted through and studied. Without exaggeration, the amount of raw nucleic sequencing 
data available to be sifted through and studied is unimaginably vast. Moreover, the body 
of data gets bigger every day (literally), since newly sequenced strands of DNA are 
documented on an ongoing basis. Because of the vastness of the genomic data, it is stored, 
handled, and manipulated via computers. 

Bioinformatics is the use of computers to retrieve, process, and analyze biological 
information. This field of data processing is now considered essential for drug discovery 
and development. Scientists are augmenting traditional "wet" biology with quantitative 
analyses, database comparisons, and computational algorithms. In this way, biology 
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1 research, at least preliminarily, is conducted in a virtual environment before the scientist 

2 sets foot in the laboratory. Bioinformatic tools and services assist pharmaceutical and 

3 biotechnology researchers with all phases of drug discovery and development including 

4 gene discovery, understanding disease pathways, identifying new disease targets and the 

5 discovery and correlation of gene sequence variation to disease. 

6 Unfortunately, conventional means of access to genomic information does not 

7 provide for comprehensive and easy access so that the data can be analyzed or studied in a 

8 computationally transparent manner. Genomic information is accumulated in a wide 

9 variety of databases, some public (freely available), some commercial (available for a 

10 price), and some proprietary ("in-house" resources that are not shared with outsiders). 

1 1 Referring to Fig. 1, genomic databases, both public 102 and private 104, as well as 

12 search tools 106 are available online. A user 110 uses an interface device 112 to access 

13 the databases 102, 104 and search tools 106. The interface device 112 communicates with 

14 a data access portal site 120 via an Internet connection 130. The portal 120 makes 

15 connections to the databases 102, 104 and search tools 106 via the Internet 140. Another 

16 operational mode for the user 1 10 to access the genomic research resources 102, 104, 106 

17 is serially via an Internet connection without using the portal 120. (This simplified mode 

18 of connection is not illustrated.) The interface device 112 is typically an implementation 

19 of a thin client browser application. 

20 Each database is made up of data objects that contain biological data. The data 

21 objects differ from one database to the next in terms of the types of biological data they 

22 contain, and in terms of their formats. Thus, studying data from plural databases requires 

23 the researcher to learn how to interpret the data as presented in the unique data structures 

24 and data contents of each particular database. This is a significant inconvenience. 

-6- 
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Thus, what is needed is a bioinformatics visualization tool that will automatically 
interpret data from diverse genomic databases (each containing genomic data objects of 
varying formats) so that it is presented to a user in a predictable, easily-recognizable 
format. 

Additionally, because of the disparate storage formats used by the different 
databases, many of the data objects to be analyzed using automated analysis tools must be 
converted from their native format into a format that is recognizable to the automated 
analysis tool to be used. This is a further inconvenience. 

Thus, what is needed is a bioinformatics software tool that will automatically 
translate data from diverse genomic databases (each containing genomic data objects of 
varying formats) so that it is presented to analysis facilities according to a uniform format. 

Furthermore, when it is discovered that a pair of data objects has a significant 
relationship to one another, there is no conventional mechanism (other than manually 
scribbling a note to oneself) for establishing a linking relationship between them. Such a 
mechanism is most conspicuously absent in the case where the pair of data objects are 
found in two entirely different databases. 

Thus, what is needed is a software facility that enables a user to establish linking 
relationships between data objects, even when those data objects are drawn from diverse 
databases and have diverse formats and types of content. 

Moreover, the existing way for accessing and navigating Hyper Text Markup 
Language (HTML) genomic data files provided by Internet database hosts or by a 
centralized Internet portal has crucial limitations. For one thing, there is no interactivity 
possible for manipulation and/or analysis of the data. This interactivity is crucial in terms 
of research effectiveness. 
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1 Thus, what is needed is a software facility that enables a user to interactively 

2 access and navigate genomic data files over the Internet, whether those files be in the form 

3 of HTML (as is typically done) or in other formats the data may appear. 

4 SUMMARY OF THE INVENTION 

5 It is an object of the present invention to provide a bioinformatics visualization tool 

6 that will automatically interpret data from diverse genomic databases (each containing 

7 genomic data objects of varying formats) so that it is presented to a user in a predictable, 

8 easily-recognizable format. 

9 It is another object of the present invention to provide a bioinformatics software 

10 process and system that will automatically translate data from diverse genomic databases 

1 1 (each containing genomic data objects of varying formats) so that it is presented to 

12 analysis facilities according to a uniform format. 

13 It is yet another object of the present invention to provide a software process and 

14 system that enables a user to establish linking relationships between data objects, even 

15 when those data objects are drawn from diverse databases and have diverse formats and 

1 6 types of content. 

17 It is still another object of the present invention to provide a software process and 

18 system that enables a user to interactively access and navigate HTML (or other format) 

19 genomic data files over the Internet. 

20 It is a further object of the present invention to provide a business process for 

21 enabling seamless access and interactivity with plural genomic databases, even in the case 

22 where one or more of those databases is a commercial (i.e., fee for access) database. 

23 Some of the above objects are made possible by a data processing system that is in 

24 electronic communication with a local genomic database system and with one or more 
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remote genomic database systems. The data processing system includes a graphical user 
interface that enables a user to view genomic data objects graphically. Ft also includes a 
genomic data object linker and a linkable data object resolver that resolves one or more 
genomic data objects, which are linkable with respect to a subject genomic data object, 
from among data objects found in the local genomic database system and the one or more 
remote genomic database systems. A resolved genomic data object that is resolved by the 
linkable data object resolver is linked to the subject genomic data object by the data object 
linker, so that the resolved genomic data object and the subject genomic data object are 
each provided to the graphical user interface. 

Some of the above objects are also made possible by a system for performing 
genomic research that is in electronic communication with a local genomic database 
system and with one or more remote genomic database systems. The system includes a 
means for presenting a user with a graphical view of genomic data objects, as well as a 
means for linking genomic data objects to one another. The system further includes a 
means for resolving a genomic data object with respect to a subject genomic data object, 
from among genomic data objects found in the local genomic database system and the one 
or more remote genomic database systems. A resolved genomic data object that is 
resolved by the means for resolving is linked to the subject genomic data object by the 
means for linking, so that the resolved genomic data object and the subject genomic data 
object are each provided to the means for printing. 

Another way that some of the above objects are made possible is by a method of 
performing genomic research with respect to a subject genomic data object, using a local 
genomic database and one or more remote genomic databases as resources. The method 
includes the act of resolving a linkable genomic data object, with respect to the subject 



WO 01/55911 PCT7US01/02527 



1 genomic data object, from among the local genomic database and the one or more remote 

2 genomic databases, regardless of the data formats of the genomic data objects. 

3 Additionally, the method includes the act of linking the subject genomic data object with 

4 the linkable genomic data object to form a set of linked genomic data objects. 

5 Furthermore, the method includes the act of storing the set of linked genomic data objects 

6 in the local genomic database. 

7 Still another way that some of the above objects are made possible is by a 

8 computer system for use in genomic research that implements a method as described 

9 above. 

10 One of the above objects is made possible by a novel technical information model 

1 1 and implementation facility. This is accomplished by combining genomic visualization 

12 and analysis tools with the ability to link genomic data objects across (and within) plural 

13 databases. 

14 Another of the above objects is made possible by a method of administering access 

15 to a plurality of genomic databases, where the genomic databases include a local genomic 

16 database, a public genomic database, and a commercial genomic database. The method 

1 7 includes a step of resolving linkable data objects with respect to a subject data object, and 

1 8 a further step of linking the resolved data objects to the subject genomic data object. The 

19 linkable data objects that are resolved from public genomic databases are resolved 

20 regardless of the data formats of genomic data objects stored therein, and without 

21 restriction as to access costs. The linkable data objects that are resolved from commercial 

22 genomic databases are resolved regardless of data formats of genomic data objects stored 

23 therein, and arc resolved subject to applicable, predetermined access agreements for the 

24 commercial genomic databases. 
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1 Still another way that some of the above objects are made possible is by a 

2 computer program product for enabling a computer to administer access to a plurality of 

3 genomic databases. The plurality of genomic databases includes a local genomic 

4 database, a public genomic database, and a commercial genomic database. The computer 

5 program product is software instructions for enabling the computer to perform 

6 predetermined operations, and a computer readable medium embodying the software 

7 instructions. The predetermined operations including the acts according to the methods 

8 discussed above. 

9 One aspect of the present invention is a local database for storing genomic data, 

10 such as nucleic acid sequences, amino acid sequences, oligonucleotides, results of Basic 

1 1 Local Aligned Search Tool (BLAST) searches, and entries from medical databases such as 

12 MEDLINE. 

13 Another aspect of the present invention is a visualization and analysis facility. 

14 Visualization and analysis is provide, preferably via dialog boxes, so that parameters may 

15 be set for BLAST searches; so that text-based searching may be made of a sequence 

16 database, and so that bibliographic search may be done of a database. A linker is included 

17 for linking together nucleic acid sequences, amino acid sequences, BLAST search results, 

18 MEDLINE entries, etc. All types of genomic/biomedical information are visualized via a 

19 graphical interface that incorporates viewer and editor components. 

20 Still another aspect of the present invention is an Internet connector having a 

21 programming module that resolves links between database objects. This connector looks 

22 either in the local database or in a remote Internet server to obtain a data object being 

23 sought. 
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1 BRIEF DESCRIPTION OF THE DRAWING 

2 Additional objects and advantages of the present invention will be apparent in the 

3 following detailed description read in conjunction with the accompanying drawing figures. 

4 Fig. 1 illustrates a conventional configuration of using a browser to access Internet 

5 database resources via a portal. 

6 Fig. 2 illustrates integrated access to Internet database resources according to an 

7 embodiment of the present invention. 

8 DETAILED DESCRIPTION OF THE INVENTION 

9 One way to view the present invention is that the traditional approach of using a 

10 thin client browser and a thin portal (refer to Fig. 1) to provide a researcher with access to 

1 1 genomic research resources is abandoned. In its place, applicants have discovered the 

1 2 increased effectiveness of a system that provides an interactive interface with the 

13 resources. This interactivity is crucial for increasing the effectiveness of genomic 

14 research. Another way to view the present invention is as a technical information model 

15 and implementation facility. The interactivity aspect of the invention is provided by the 

1 6 combination of linking of data objects and its visualization and analysis aspects. 

1 7 In addition to a local database for storing intermediate and finalized results, the 

18 present invention has a visualization and analysis aspect. Visualization is provide in an 

19 advanced form that shows the sequences and other molecules in graphical presentations 

20 that are intuitively appealing to the human perception. In addition to interactivity, the 

21 ability for the user to easily integrate (i.e., link) data and store the integrated data in the 

22 local database is very helpful. 

23 An additional functionality that is provided in a preferred embodiment of the 

24 present invention are full-scale analysis tools and algorithms directly available at the user 
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1 interface rather than remotely over the Internet. Full-scale analysis allows DNA to be 

2 evaluated in view of, protein, enzyme, and oligos data sets, BLAST results, MEDLINE 

3 Entrez data, and amino acids. 

4 An example of a software product that can provide the visualization and analysis 

5 aspects of the present invention is the VectorNTI ™ product of InforMax, Inc. of North 

6 Bethesda, Maryland. 

7 Another aspect according to a preferred embodiment of the present invention is the 

8 use of a resolving system that reaches out to plural databases over the Internet (e.g., NCBI, 

9 Entrez, PubMed, SRS) to provide integrated database searches. 

10 Referring to Fig. 2, databases, both public 202 and private 204, and various 

1 1 research tools 206 are available online. Also available as a research resource are the 

12 previous research results and other proprietary data that user 210 stores in a local database 

13 system 208. As in the prior art, the Internet 220 is used as a communication medium to 

14 access the various remote resources 202, 204, 206. However, in contrast with the prior art, 

1 5 an entirely different set of tools is used for conducting research. 

16 The user 210 utilizes a user interface 230 that includes a visualization system 232 

1 7 and a data set linking facility 234 (hereinafter "linker" for short). The linker 234 provides 

1 8 for integration of data sets that the user deems to be worthy of being associated with one 

19 another for further study in relation to one another. Data sets so linked may be more 

20 closely examined or analyzed using analysis tools and algorithms 240. Examples of useful 

21 analysis tools and algorithms to include for use with the present invention are BioPlot™, 

22 AlignX™, and ContigExpress™, which are all products of InforMax, Inc. of North 

23 Bethesda, MD. A number of other available analysis tools, such as BLAST may also be 

24 usefully employed. 
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1 This examination and analysis produces results that may themselves be linked to 

2 the data sets from which they were derived. The local database 208 is used to store the 

3 integrated data sets and results for later study by the user 210, or his or her colleagues. 

4 The later study may be in the form of additional computer analysis, or in a biology 

5 laboratory if the results are deemed to be sufficiently promising. 

6 Candidates for linking are identified by the linkable data object resolving system 

7 250. The linkable data object resolving system 250 connects via the Internet 220 to access 

8 any of various search tools 206 and databases 202, 204 to search for data objects that are 

9 relevant to a subject data object that the user 210 has identified as being of interest. The 

10 resolving system 250 does not establish links. Rather, the resolving system 250 identifies 

1 1 data objects from the enormously vast collections of data that are available for inspection 

12 over the Internet that should have a reasonable probability of being relevant to the subject 

13 data object. 

14 To limit and guide the searching by the resolving system 250, the permissioning 

15 and accounting module 260 directs the resolving system 250 to access only databases that 

16 are public 202 or those commercial databases 204 for which access agreements have been 

1 7 established. The accounting aspect of the permissioning and accounting module 260 

1 8 keeps records of access times, durations, and authorizations regarding the use of the 

1 9 commercial databasses 204. 

20 In addition to the apparatus aspects of the present invention, methods also form 

21 some aspects of the invention. A method of performing genomic research according to the 

22 present invention is accomplished, first, by resolving a linkable genomic data object, and 

23 then, by linking the linkable genomic data object with a subject genomic data object to 

24 form a set of linked genomic data objects. The resolving act is performed with respect to 
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1 the subject genomic data object, using one or more remote genomic databases as 

2 resources. Optionally, a local genomic database is also used in resolving a linkable 

3 genomic data object. The act of resolving is performed regardless of the data formats of 

4 the genomic data objects as they may be found in the various databases. After the 

5 genomic data objects are linked, they are preferably stored as a set of linked genomic data 

6 objects in the local genomic database. 

7 Another aspect of the present invention is that it represents a business process 

8 wherein predetermined access agreements for commercial databases are used to guide 

9 research steps so that accessing of these databases is entirely seamless from the point of 

10 view of the researcher/user employing the process. This results in a process of 

1 1 administering access to a plurality of genomic databases, public, commercial, as well as 

12 local (possibly proprietary). The process includes a step of resolving linkable data objects 

1 3 with respect to a subject data object, and a further step of linking the resolved data objects 

14 to the subject genomic data object. The linkable data objects that are resolved from public 

1 5 genomic databases are resolved regardless of the data formats of genomic data objects 

1 6 stored therein, and without restriction as to access costs. The linkable data objects that are 

1 7 resolved from commercial genomic databases are resolved regardless of data formats of 

1 8 genomic data objects stored therein, and are resolved subject to applicable, predetermined 

1 9 access agreements for the commercial genomic databases. 

20 The present invention has been described in terms of preferred embodiments, 

21 however, it will be appreciated that various modifications and improvements may be made 

22 to the described embodiments without departing from the scope of the invention. The 

23 scope of the invention is limited only by the appended claims. 
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WHAT IS CLAIMED IS: 

1 1 . A data processing system that is in communication with a local database system 

2 and that is in communication with one or more remote database systems, the data 

3 processing system comprising: 

4 a graphical user interface enabling a user to view data objects graphically; 

5 a data object linker; 

6 a linkable data object resolver that resolves one or more data objects, which are 

7 linkable with respect to a subject data object, from among data objects found in the local 

8 database system and the one or more remote database systems; 

9 wherein a resolved data object resolved by the linkable data object resolver is 

1 0 linked to the subject data object by the data object linker, so that the resolved data object 

1 1 and the subject data object are each provided to the graphical user interface. 

1 2. A data processing system that is in electronic communication with a local 

2 genomic database system and that is in electronic communication with one or more remote 

3 genomic database systems, the data processing system comprising: 

4 a graphical user interface enabling a user to view genomic data objects graphically; 

5 a genomic data object linker; 

6 a linkable data object resolver that resolves one or more genomic data objects, 

7 which are linkable with respect to a subject genomic data object, from among data objects 

8 found in the local genomic database system and the one or more remote genomic database 

9 systems; 

10 wherein a resolved genomic data object resolved by the linkable data object 

1 1 resolver is linked to the subject genomic data object by the data object linker, so that the 
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12 resolved genomic data object and the subject genomic data object are each provided to the 

13 graphical user interface. 

1 3. The data processing system of claim 2, wherein the data processing system 

2 communicates with the one or more remote genomic database systems via a network. 

1 4. The data processing system of claim 2, wherein the data processing system 

2 communicates with the one or more remote genomic database systems via a network of 

3 networks. 

1 5. The data processing system of claim 2, wherein the data processing system 

2 communicates with the one or more remote genomic database systems via the Internet. 

1 6. The data processing system of claim 2, wherein the resolved genomic data 

2 object and the subject genomic data object are each of a data type selected from the group 

3 consisting of: nucleic acid sequences, amino acid sequences, olgionucleotides, results of a 

4 BLAST search, and medical data. 

1 7. The data processing system of claim 6, wherein the resolved genomic data 

2 object and the subject genomic data object are of different data types. 

1 8. The data processing system of claim 2, wherein the local genomic database 

2 system and the one or more remote genomic database systems each contain data objects of 

3 types that are selected from the group consisting of: nucleic acid sequences, amino acid 

4 sequences, olgionucleotides, results of BLAST searches, and medical data. 
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1 9. The data processing system of claim 2, wherein the graphical user interface has 

2 the capability to graphically depict nucleic acid sequences, amino acid sequences, 

3 olgionucleotides, results of BLAST searches, and medical data. 

1 10. The data processing system of claim 2, wherein the genomic data object linker 

2 links genomic data objects that are of differing data types. 

1 1 1 . A system for performing genomic research, the system being in electronic 

2 communication with a local genomic database system and that is in electronic 

3 communication with one or more remote genomic database systems, the system 

4 comprising: 

5 means for presenting a user with a graphical view of genomic data objects; 

6 means for linking genomic data objects to one another; and 

7 means for resolving a genomic data object with respect to a subject genomic data 

8 object, from among genomic data objects found in the local genomic database system and 

9 the one or more remote genomic database systems; 

10 wherein a resolved genomic data object resolved by the means for resolving is 

1 1 linked to the subject genomic data object by the means for linking, so that the resolved 

12 genomic data object and the subject genomic data object are each provided to the means 

13 for printing. 

1 12. The system for performing genomic research of claim 1 1 , wherein the system 

2 communicates with the one or more remote genomic database systems via the Internet. 

1 13. The system for performing genomic research of claim 1 1, wherein the resolved 

2 genomic data object and the subject resolved genomic data object are each of a data type 
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selected from the group consisting of: nucleic acid sequences, amino acid sequences, 
olgionucleotides, results of a BLAST search, and medical data. 

14. The system for performing genomic research of claim 13, wherein the resolved 
genomic data object and the subject resolved genomic data object are of different data 
types. 

1 5. The system for performing genomic research of claim 11, wherein the local 
genomic database system and the one or more remote genomic database systems each 
contain data objects of types that are selected from the group consisting of: nucleic acid 
sequences, amino acid sequences, olgionucleotides, results of BLAST searches, and 
medical data. 

16. The system for performing genomic research of claim 1 1, wherein the means 
for presenting has the capability to graphically depict nucleic acid sequences, amino acid 
sequences, olgionucleotides, results of BLAST searches, and medical data. 

17. The system for performing genomic research of claim 1 1, wherein the means 
for linking links genomic data objects that are of differing data types. 

18. A computer system adapted to genomic research with respect to a subject 
genomic data object, using a local genomic database and remote genomic databases as 
resources, the computer system comprising: 

a processor, and 

a memory, in electronic communication with the processor, including software 
instructions adapted to enable the computer system to perform the steps of: 



- 19- 



WO 01/55911 



PCT7US01/02527 



7 resolve a linkable genomic data object, with respect to the subject genomic data 

8 object, from among the local genomic database and the remote genomic 

9 databases, regardless of the data formats of the genomic data objects; 

10 link the subject genomic data object with the linkable genomic data object to 

1 ] form a set of linked genomic data objects; and 

12 store the set of linked genomic data objects in the local genomic database. 

1 19. A method of performing genomic research with respect to a subject genomic 

2 data object, using a local genomic database and one or more remote genomic databases as 

3 resources, the method comprising: 

4 resolving a linkable genomic data object, with respect to the subject genomic data 

5 object, from among the local genomic database and the one or more remote genomic 

6 databases, regardless of the data formats of the genomic data objects; 

7 linking the subject genomic data object with the linkable genomic data object to 

8 form a set of linked genomic data objects; and 

9 storing the set of linked genomic data objects in the local genomic database. 

1 20. The method of performing genomic research of claim 19, wherein the resolved 

2 genomic data object and the subject genomic data object are each of a data type selected 

3 from the group consisting of: nucleic acid sequences, amino acid sequences, 

4 olgionucleotides, results of a BLAST search, and medical data. 

1 21. The method of performing genomic research of claim 20, wherein the resolved 

2 genomic data object and the subject genomic data object are of different data types. 
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1 22. The method of performing genomic research of claim 19, wherein the local 

2 genomic database system and the one or more remote genomic database systems each 

3 contain data objects of types that are selected from the group consisting of: nucleic acid 

4 sequences, amino acid sequences, olgionucleotides, results of BLAST searches, and 

5 medical data. 

1 23. The method of performing genomic research of claim 19, the method further 

2 comprising: 

3 providing a graphical user interface to graphically depict the subject genomic data 

4 object and the resolved genomic data object, regardless of whether they are nucleic acid 

5 sequences, amino acid sequences, olgionucleotides, results of BLAST searches, or medical 

6 data. 

1 24. The method of performing genomic research of claim 19, wherein the act of 

2 linking links genomic data objects that are of differing data types. 

1 25. A method of administering access to a plurality of genomic databases, the 

2 plurality of genomic databases including a local genomic database, a public genomic 

3 database, and a commercial genomic database, the method comprising: 

4 resolving one or more linkable data objects with respect to a subject data object; 

5 and 

6 linking the one or more resolved data objects to said subject genomic data object; 

7 wherein the one or more linkable data objects are resolved from public genomic 

8 databases, regardless of data formats of genomic data objects stored therein, and without 

9 restriction as to access costs; and 
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1 0 wherein the one or more linkable data objects are resolved from commercial 

1 1 genomic databases, regardless of data formats of genomic data objects stored therein, and 

1 2 subject to predetermined access agreements for the commercial genomic databases. 

1 26. The method of administering access to a plurality of genomic databases recited 

2 in claim 25, wherein the resolved genomic data object and the subject genomic data object 

3 are each of a data type selected from the group consisting of: nucleic acid sequences, 

4 amino acid sequences, olgionucleotides, results of a BLAST search, and medical data. 

1 27. The method of administering access to a plurality of genomic databases recited 

2 in claim 26, wherein the resolved genomic data object and the subject genomic data object 

3 are of different data types. 

1 28. The method of administering access to a plurality of genomic databases recited 

2 in claim 25, wherein the local genomic database system and the one or more remote 

3 genomic database systems each contain data objects of types that are selected from the 

4 group consisting of: nucleic acid sequences, amino acid sequences, olgionucleotides, 

5 results of BLAST searches, and medical data. 

1 29. The method of administering access to a plurality of genomic databases recited 

2 in claim 25, the method further comprising: 

3 providing a graphical user interface to graphically depict the subject genomic data 

4 object and the resolved genomic data object, regardless of whether they are nucleic acid 

5 sequences, amino acid sequences, olgionucleotides, results of BLAST searches, or medical 

6 data. ; 
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30. The method of administering access to a plurality of genomic databases recited 
in claim 25, wherein the act of linking links genomic data objects that are of differing data 
types. 

3 1 . A computer program product for enabling a computer to perform genomic 
research with respect to a subject genomic data object, using a local genomic database and 
remote genomic databases as resources, the computer program, product comprising: 

software instructions for enabling the computer to perform predetermined 
operations, and 

a computer readable medium embodying the software instructions; 
the predetermined operations including the acts of: 

resolve a linkable genomic data object, with respect to the subject genomic data 
object, from among the local genomic database and the remote genomic 
databases, regardless of the data formats of the genomic data objects; 
link the subject genomic data object with the linkable genomic data object to 

form a set of linked genomic data objects; and 
store the set of linked genomic data objects in the local genomic database. 

32. A computer program product for enabling a computer to administer access to a 
plurality of genomic databases, the plurality of genomic databases including a local 
genomic database, a public genomic database, and a commercial genomic database, the 
computer program product comprising: 

software instructions for enabling the computer to perform predetermined 
operations, and 

a computer readable medium embodying the software instructions; 

-23- 



WO 01/55911 



PCT/USO 1/02527 



8 the predetermined operations including the acts of: 

9 resolve one or more linkable data objects with respect to a subject data object; 

10 and 

1 1 linking the one or more resolved data objects to said subject genomic data 

12 object; 

13 wherein the one or more linkable data objects are resolved from public 

14 genomic databases, regardless of data formats of genomic data objects 

1 5 stored therein, and without restriction as to access costs; and 

16 wherein the one or more linkable data objects are resolved from commercial 

17 genomic databases, regardless of data formats of genomic data objects 

1 8 stored therein, and subject to predetermined access agreements for the 

19 commercial genomic databases. 
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